SPACES WITH MANY AFFINE FUNCTIONS 

PETRA HITZELBERGER AND ALEXANDER LYTCHAK 

Abstract. We describe all metric spaces that have sufficiently 
many affine functions. As an application we obtain a metric char- 
acterization of linear-convex subsets of Banach spaces. 



1. Introduction 

Given a geodesic metric space X the existence of a non-constant 
affine function on X seems to impose strong restriction on the geometry 
of the space. Here a function / : X — > R is called affine if its restriction 
on each arclength parametrized geodesic 7 in X satisfies f(j(t)) = at+b 
for some numbers a, b that may depend on 7. For instance, if X is a 
complete Riemannian manifold such a function happens to exist if and 
only if X has a non-trivial Euclidean de Rham factor ( Inn82j) and the 
same is true in some more general spaces ([Mas02 j ,[AB05j). 

However, it is not clear what happens in the general situation. The 
expectation is that the existence of such a function forces the space X 
to look like a Banach space in some "direction" defined by the affine 
function, compare the main result in |LS04] and examples there. Thus 



it seems natural to expect that if X has sufficiently many affine func- 
tions it should look very similar to some Banach space. Our result 
confirms this expectation: 

Theorem 1.1. Let X be a geodesic metric space. Affine functions on 
X separate points of X if and only if X is isometric to a convex subset 
of a normed vector space with a strictly convex norm. 

Here we say that affine functions separate points of X if for each 
pair of points x, x G X there is an affine function / : X — > R with 
/(•'') / /(•'•)• 

It is possible to look on our theorem from another point of view. In 
order to do this we define a constant speed geodesic in a metric space 
to be a curve that has a constant speed and is globally minimizing 
between its endpoints. A map F : X — > Y between geodesic metric 



1991 Mathematics Subject Classification. 53C20. 

Key words and phrases. Affine functions, Banach spaces, geodesic mappings. 

1 



spaces is called affine if it sends each constant speed geodesic in X to 
a constant speed geodesic in Y. We say that F is an affine equivalence 
if F is bijective and F and F" 1 are affine. 

It is a natural question to which extent constant speed geodesic de- 
termine the metric on a space, i.e. given a geodesic metric space X 
what are all metric spaces Y that are affinely equivalent to the space 
X. If X and Y are complete Riemannian manifolds the answer to this 
question has been known for a long time ( |lnn82j ) and says that Y has 
a de Rham decomposition that consists of direct factors of X stretched 
by constants. 

Remark 1.1. If (still in the realm of Riemannian geometry), one relaxes 
the condition of affine equivalence to that of projective equivalence, i.e. 
if one only requires the geodesies of X and Y to coincide as subsets 
(unregarding the parametrisation) the question becomes much deeper. 
In fact it is a very old problem to which many partial answers are 
known, see, for instance, |MatQ3j and the literature there. 

If (while looking for affinely equivalent spaces) we still ask X to be 
a Riemannian manifold but allow Y to be arbitrary then the answer 
of |Inn82j must change. For example, the Euclidean space is affine 
equivalent to each normed vector space of the same dimension with 
a strictly convex norm. As a consequence of our result we see that 
nothing more can happen if X is Euclidean. Namely the notion of an 
affine function does not change under affine equivalence, hence from 
Theorem ll.il we derive: 

Corollary 1.2. Let C be a convex subset of a Euclidean space IR n . If 
Y is a geodesic metric space that is affinely equivalent to C then one 
can equip W 1 with a strictly convex norm such that C in this new norm 
is isometric to Y . 

Remark 1.2. In |Oht03j it is shown that if X is a Riemannian manifold 
and Y is affine equivalent to X, then the metric on Y is defined by a 
continuous Finsler metric. We think that our arguments can help to 
describe the possible Finsler metric precisely. 

The restriction to strictly convex norms in Theorem 11.11 seems arti- 
ficial. However, one should be careful when trying to drop this restric- 
tion. In fact each metric space has an isometric embedding in some 
(non-strictly convex) Banach space. On the other hand M 2 with the 
maximum norm ||(x,?/)|| = max{|x|, \y\} has too many geodesies to 
admit an affine function in the sense of the definition above. To avoid 
these difficulties one has to distinguish good (i.e. linear) geodesies from 
bad ones, thus we need the notion of a bicombing. 
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Let X be a metric space. A bicombing r on X is an assignment to 
each pair x, y of points in X a geodesic j xy connecting x and y, such 
that 'jyx is the geodesic ^ xy with the reverse orientation. Moreover, we 
require that for a point m on a geodesic j xy the geodesic ^ my is part 
of the geodesic j xy . We do not require that the geodesies j xy depend 
continuously on x and y. 

We say that a map F : X — > Y between two spaces with bicombings 
denote by the same letter T is T-amne if it for all x, x G X there are 
some numbers a, b such that f(j xx (t)) = lf(x)f{x)( a t + b) holds. 

Example 1.3. If X is a uniquely geodesic metric space, i.e. if between 
each two points of X there is precisely one geodesic, then we have a 
unique bicombing r on X and a I"- affine function on X is just an affine 
function in the sense of the old definition. 

Example 1.4. Each normed vector space has a natural bicombing that 
assigns to two points the linear interval between them (jxyit) — x + 
tyj|5f||)- All normed vector spaces will be considered with this bicomb- 
ing T in the sequel. With this definition each linear map between two 
normed vector spaces is a r-affine map. On the other hand each V- 
affine map F : V — > W between normed vector spaces has the form 
F(v) = F(0) + A(v) for some linear map A, i.e. F is affine in the usual 
sense of linear algebra. 

Using this notion we can now extend Theorem 11.11 

Theorem 1.3. Let X be a space with a bicombing V . Then the follow- 
ing are equivalent: 

(1) There is a Banach space B and an injective isometric T-ajfine 
map f : X —> B; 

(2) The r -affine functions separate the points of X; 

(3) There is a Banach space B and an injective Y -affine map f : 
X -> B. 

The equivalence between (2) and (3) can be reformulated as a metric 
characterization of linear convex subsets of Banach spaces: 

Corollary 1.4. Let X be a metric space. Then X is isometric to a 
linear convex subset of some Banach space if and only if X admits a 
bicombing V such that the V -affine functions separate points of X . 

A short comment on the regularity of affine functions. It is well 
known that an affine functions on a Riemannian manifold is smooth. 
However, in general spaces affine functions do not need to be even con- 
tinuous, for instance, there are many non- continuous linear functions 
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on Hilbert spaces. In our theorem we did not make any restrictions on 
the affine functions, but a combination of Theorem 11.31 and the Hahn- 
Banach theorem immediatly shows that for each geodesic metric space 
X Lipschitz continuous affine function separate points of X if arbitrary 
affine functions separate points of X. 

The paper is organized as follows. In Section El we show that Theo- 
rem ll.3l implies Theorem 1 1 . 1 1 and prove the straightforward implications 
(1) — > (2) — > (3) in Theorem 11.31 In Section |U we reduce the proof of 
the implication (3) — > (1) to the case where X is an open convex subset 
of lR n with some Finsler metric on it. Finally in Section |S] we discuss 
first variation formulas in this Finsler metric and prove that the Finsler 
structure is constant. 

We would like to express our gratitude to Werner Ballmann for his 
encouragement and support. We are grateful to Linus Kramer for help- 
ful comments. 

2. Preliminaries 

2.1. Vector spaces. Linear intervals in a vector space V are curves 
7 : [0, 1] — > V of the form •yit) — tv + (1 — t)w for some v, w £ V. A 
subset C of V is called linearly convex if it contains each linear interval 
7 as above for all v , w G C. Let C be a linearly convex subset of V 
that contains the origin 0. Then the linear hull H(C) of C (i.e. the 
smallest linear subset of V that contains C) is the set of all points x 
that can be represented as x = X(v — w) for some A > and some 
v,w 6 C. Let C C V be linearly convex subset of V that contains 
the origin and assume that V coincides with the linear hull H(C) of 
C . By the dimension of C we denote the dimension of the vector space 
V = H(C). If the dimension of C is finite, it is well known that the set 
O of inner points of C (with respect to the usual Euclidean topology of 
H(C)) is convex, non-empty and that the closure O of O in V contains 
C. 

2.2. Metric spaces. By d we will denote distances in metric spaces. A 
geodesic respectively a ray in a metric space X will denote an isometric 
embedding 7 : / — > X of an interval respectively of a half-line into X. 
Note that geodesies (if not otherwise stated) are parametrized by the 
arclength. 

A metric space is called geodesic if each pair of its points is connected 
by a geodesic. It is called uniquely geodesic if this connecting geodesic 
is unique. 

Given a ray 7 in a metric space (X, d) the Busemann function 0/7 is 
defined by b 7 (x) := \im t ^ OQ (d('y(t) , x) — cf(7(0), j(t))). The Busemann 
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function exists and is a 1-Lipschitz function on X. Moreover 6 7 (x) < 
d(y(t), x) - d(y(0), y{t)) for all t > 0. 

2.3. Basics on normed vector spaces. Let (V, || • ||) be a vector 
space with a norm and denote by d the induced metric on V. The 
norm is called strictly convex if for all linear independent v , w G V one 
has + < |M| + \\w\\. The norm || ■ || is strictly convex if and only 
if (V, d) is uniquely geodesic. 

For each non-zero vector h in V we denote by y h the ray y h (t) = tjf^- 
The Busemann functions bh '■— b^h of such linear rays have the following 
properties, that are direct consequences of the definition. 

(1) b h (v) = limbed \th-v\\ - \\th\\) = lim^ 0+ ( " ft - to j'-" ft " ); 

(2) b h (tv) = tb h (v), for all t > 0; 

(3) b h (vx + v 2 ) < b h (vx) + b h (v 2 ); 

(4) b h (v)=b. h {-v). 

We see that the Busemann function bh is linear if and only if the 
equality bh(v) = —bh(—v) holds for all v G V. However, this equality 
is equivalent to the following well known one: 

Definition 2.1. A point h G V is called smooth in the Norm || • || if 
the following holds for all v G V: 

Um \\h-tv\\-\\h\\ = _ Um \\h + tv\\ -\\h\\ _ 
t^o+ t t-*0+ t 

Thus the Busemann function bh is linear if and only if h is a smooth 
vector in the norm || • ||. If V is a finite dimensional vector space, then 
a vector h is smooth in the norm || • || if and only if the Lipschitz map 
|| ■ || : W 1 — > K has a linear differential at the point h. Due to the 
theorem of Rademacher we know that in a finite dimensional normed 
vector space V almost each vector is smooth with respect to the norm. 

3. Easy implications 

3.1. Strictly convex case. First we are going to derive Theorem ll.il 
from Theorem 11.31 

Let X be a space on which affine functions separate the points. Let 
x, x be two arbitrary points in X. For each geodesic y from x to x 
each affine function / has the value on the midpoint m of y. 

Thus all affine functions have the same value on the midpoint of each 
geodesic between x and x. From the separation assumption we deduce 
that X is uniquely geodesic. 

By Theorem 11.31 we know that X is isometric to a linearly convex 
subset of a Banach space B. We replace X by this subset and may 
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assume that it contains the origin 0. Denote by V its linear hull in 
B with the norm induced from the ambient Banach space B. Each 
element v of V has the form X(x — x) for some A > and some x, x G A. 

We claim that V is strictly convex. Assume the contrary and find 
linearly independent v,w G V with ||u + iu|| = ||i>|| + ||w||. We can find 
a finite dimensional linearly convex subset A of A that contains and 
whose linear hull V contains v and w. Due to finite dimensionality X 
contains an open ball in V and since V is not strictly convex, the subset 
X is not uniquely geodesic. But X is a convex subset of a uniquely 
geodesic space X and we arrive at a contradiction. 

The other direction is clear since linear functions are affine on a 
strictly convex normed vector space and separate its points (compare 
the next subsection). 

3.2. Linear algebra. We prove (1) — > (2) of Theorem II .31 here. Thus 
let (V, || • ||) be a normed vector space. Observe that each linear function 
on V is r-affine with respect to the natural bicombing. Since linear 
functions separate points of V (by linear algebra, you may also apply 
the Hahn-Banach theorem), we see that r-affine functions separate 
points of V. We deduce that r-affine functions also separate points on 
each space X that admits an injective r-affine map % : X — > V. 

3.3. Evaluation. Assume now (2) of Theorem 11.31 Consider the vec- 
tor space TAff(X) of all r-affine functions on X and denote by V its 
(algebraic) dual space (TAf f(X))* . Define the natural evaluation map 
E : X — > V by E(x)(f) = f(x) (compare LS04J). The map is well 
defined and by the separation property it is injective. 

We claim that E sends each T-geodesic onto a linearly reparametrized 
linear interval. 

In fact, let a geodesic ^ xy be given and reparametrize it to a constant 
speed geodesic ^ xy : [0, 1] — > X such that 

7xy(0) — x i 7xy(l) = V- For each t G [0, 1] and each affine function 
/ : X — > R we see f(% y (t)) = (1 — t)f(x) + tf(y). Hence for our map 
E we deduce E(% y (t)) = (l-t)E(x)+ tE(y). 

Now we equip V with an arbitrary norm. The linear intervals become 
linearly reparametrized T-geodesics for the natural bicombing V on V. 
Hence the map E : X — > V becomes r-affine. Taking B to be the 
completion of V we obtain (3) of Theorem 11.31 

4. Reduction 

In this section we reduce the implication (3) — > (1) of Theorem 11.31 
to the case where A is a Finsler manifold. 
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4.1. Reformulation. Let X be a space with a bicombing T that ad- 
mits an injective T-affine map i : X — > Z? into a Banach space 5. We 
forget the norm on S and identify X with its image C. We may assume 
that is contained in C and replace B by the linear hull V of C in £?. 

We have to prove that we can define a norm on V such that the 
induced metric on C coincides with the given one. Thus the implication 
we are looking for is implied by the following: 

Lemma 4.1. Let V be a vector space, C a linear convex subset of V 
that contains the origin 0. Let d be a metric on C such that the linear 
intervals contained in C are constant speed geodesies with respect to the 
metric d. Then there is a norm || ■ || onV such that the induced metric 
on C coincides with d. 

4.2. Reduction to finite dimension. Since norms defined on linear 
subspaces can be extended to norms on the whole space it is enough 
to prove that such a norm exist on the linear hull H(C) C V. Each 
element v G H(C) has the form v = X(x — x) for some x,x G C and 
some A > 0. Thus the norm we are looking for is unique, if it exists, 
and must be given by ||t>|| = Xd(x,x), where v has a presentation as 
above. 

Thus we need to prove that this is a well defined quantity (i.e. in- 
dependent of the representation of v) and that it defines a norm on 

Observe that each linearly convex subset C of C that contains and 
is considered with the metric induced by d also satisfies the assumptions 
of Lemma 14.11 We deduce that it is enough to prove Lemma 14.11 in 
the case where the dimension of C is finite. For instance, in order 
to see that || • || is well defined take a point v G H(C) that has two 
different presentations v = X(x — x) and v = X\{xi — X\). Consider 
the linear convex hull C of the five points 0. Assuming 

that Lemma 14.11 is true for the finite dimensional C we deduce that 
Xd(x,x) = Xid(xi,xi), hence || • || is well defined. The fact that || ■ || is 
a norm is shown in the same way. 

Thus it is enough to prove Lemma 14.11 in the case where C has a 
finite dimension. 

4.3. Reduction to open subsets. Let C be as in Lemma 14.11 and 
assume that the dimension of C is finite. Replacing V by the linear 
hull H(C) we may assume that V = M. n . If Lemma [4. II is true for the 
set O of inner points of C then by continuity it is also true for the 
closure O of O in MJ 1 and therefore it is also true for C G O. 
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Thus it is enough to prove Lemma I4~T1 in the case where V = M. n and 
where C is a convex open subset of W 1 . 

4.4. Reduction to a Finsler metric. Assume now that in Lemma l4~Tl 

we have V = MJ 1 and that C is open. We claim that the metric d on C 
is defined by a continuous Finsler structure on C . 

In fact it is more or less a special case of Theorem B in |Oht03| . We 
shortly recall the arguments for the convenience of the reader, since the 
assumptions in |Oht03j are slightly different. First of all one defines 
for each x G C and each v G M n the quantity |i>| ffi as d(x,x + ev)/e, 
for a sufficiently small positive real number e. Since j(t) = x + tv is 
a constant speed curve on the whole interval that is contained in the 
open set C, we deduce that the definition of |i>| x does not depend on 
the choice of e, and in fact it is just the speed of the curve 7. 

Moreover, we immediately deduce that \Xv\ x = |A| • |u| x . Since d is a 
metric we have \v\ x > with equality if and only if v = 0. 

We need to assure that the metric space (C, d) is locally compact, 
i.e. that the identity id : (C, dEud) — ► (C, d) is continuous. This follows 
directly, as soon as one knows that for given x G C the function | • \ x is 
bounded on compact subsets of MJ 1 (or equivalently on the unit ball). 
The last claim is the content of Proposition 2.1 in Oht03j. Fix now 
a small e and take a convergent sequence X{ — > x and a convergent 
sequence Vi — > v . We observe that Xi + eVi converge to x + ev , hence 
by definition \vi\ Xi must converge to \v\ x . Therefore the function \v\ x 
is continuous in v and x. From this we can derive that | • \ x is a norm. 
Namely, we have d(x, x+e(v+w)) < d(x,x+ev)+d(x+ev,x+ev+ew) = 
e\v\ x + e\w\ x+ev . Dividing by e we get in the limit \v + w\ x < \v\ x + \w\ x . 

Thus I • |a; is a continuous Finsler structure and defines a metric d on 
C . The conclusion that d coincides with d is a direct consequence of 
the fact that linear intervals have the same length with respect to the 
metric d and d. 

Thus we have shown that the metric d is given by a continuous 
Finsler structure | • \ x on C. 

5. Final step 

We are left with the following problem. Let O C M. n be an open 
linear convex subset. Let | • \ x be a continuous Finsler structure on 
O, such that for the induced metric d the linear intervals are constant 
speed geodesies. We need to show that there is a norm || ■ || on M. n such 
that the metric d on O induced by this norm coincides with our metric 
d. This conclusion follows as soon as we know that | • \ x = \ ■ \ y holds 
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for all x, y G O. The rest of this section is devoted to the proof of the 
last statement. 

First of all linear intervals have constant speed, hence for each igO, 
each v G M n and each t e R such that x + tv is still in O we have 
\v\ x = MaH-tu- Since linear intervals are geodesies we conclude that 
d(x,x + tv) = \tv\ x , i.e. for all x, y G O we have d(x,y) — \x — y\ x . 
(To avoid confusion: what we need is the much stronger statement 
d(y, z) = \y — z\ x for all x,y,z G 0\). 

Let us fix an arbitrary v G M. n . It is enough to show that \h\ x = \h\ x+v 
for almost all ftel" and all x, x + v G O. Here and below we consider 
MJ 1 , O the tangent bundle TO = Oxi™ and linear intervals equipped 
with natural Lebesgue measures. By abuse of notation we will say that 
h G M. n is smooth at a point x G O if h is a smooth vector of the norm 
| • \ x (compare Definition 12.1)1 . For each x G O almost all h G M. n are 
smooth at a;. Applying Fubini's theorem twice we find a set S of full 
measure in ~R n with the following property. For each vector h G S the 
set of all x G O at which ft, is smooth is of full measure in O. 

It is enough to show that for each h G S the equality \h\ x — \h\ x+v 
holds, for all x, x + v G O. Let us fix a vector h G 5. Another 
application of Fubini's theorem tells us, that for almost all x G O the 
vector h is smooth at almost each point of the linear interval rj(t) = 
x + tv (contained in O). By continuity of the Finsler structure it 
is enough to show that fit) = \h\ x+tv does not depend on t if h is 
smooth at almost each point of r]. The question is local in t, hence 
it is enough to show that f(t) is constant for all small t. We may 
assume (replacing h by its multiple) that x + h is contained in O. Then 
fit) = d(x + tv,x + tv + h), in particular, the function / is locally 
Lipschitz. Hence it is enough to show that f'(t) = for almost all t. 
Therefore our result is a consequence of the following lemma. 

Lemma 5.1. Let x be a point in O, h,v G M n . Assume that h is a 
smooth vector in the norm | • \ x . Then for the function f(t) = \h\ x+tv 
we have f'(0) = 0. 

Proof. Replacing h by its multiple we may assume that y := x + h is 
contained in O. Our claim is |ft| x +t„ = \h\ x + o(t), where o(t) is some 
function with lim t ^ ^ = 0. Since \h\ x+tv = d(x + tv,y + tv) we just 
have to find right first variation formulas in the Finsler manifold O. We 
refer to Section 9 in |Lyt04| for a general discussion of first variation 
formulas. We need some notations. For a vector w G M. n a point z G O 
we will denote by h z w : M" — > R the Busemann function of the direction 
w in the norm | • | 2 , i.e. b z w {v) = lim.t->oo(\tw — v\ z — \tw\ z ). We refer to 
Subsection 12.31 for basic properties of such functions. 
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First we are going to show that b% = b\. In order to do so we have 
to study the first derivative of the function f(t) = d(y,x + tv). Recall 
that \h\ x = \h\ y . 

Due to d(y, x + tv ) = \y — x — tv\ y = \h — tv\ y and properties of the 
Busemann functions (see Subsection 12. 3|) we have 

(1) d(y,x + tv) = \h\ y + tb v h {v) + o(t). 

On the other hand we can estimate d(y, x + tv) by working in a small 
neighborhood of x. Fix s ^> 1. For < t <C - we have d(y, x + tv) < 
d(y, x + tsh) + d(x + tsh, x + tv ). Since the metric on O is given by a 
continuous Finsler structure, we have d(x+tsh, x+tv) = t\sh—v\ x +o(t) 
(i.e. (M n , | • is the tangent space of O at x in the sense of metric 
geometry, |Lyt04| ). We obtain d(y, x + tv) < \h\ x (l — ts)+t\sh — v\ x + 
o(t). If s is large enough \sh — v\ x is very close to s + bf(v). Letting s 
go to infinity we conclude, that 

(2) d(y,x + tv) < \h\ x + tb x h+ (v) + o(t) for t > 0. 

Consider d(y, x + tv) + d(x +tv,x — tsh) > d(y, x — tsh) = \h\ x (l — ts) 
for a fixed big s and let t go to + . Then as above we obtain 

(3) d(x + tv, y) > \h\ x - tb x _ h {y) + o{t) for t > 0. 

By our assumption vector h is smooth at x, thus b x _ h {y) = —b^(v). 
Combining (1), (2), (3) above we deduce that b%(v) = b y h {v). Since the 
above calculations work for all v € M n we get b x l = b\. In particular, h 
is also smooth in | • 1^ and we have b x _ h = b y _ h . Now we can finish the 
proof of Lemma f5. 11 Arguing as in the proof of (2) and (3) above. We 
take a large s and estimate the distance d(x + tv,y + tv) from above 
(for all small < t < ±) by 

d(x + tv,y + tv) <d(x + tv,x + tsh) + d(x + tsh, y — tsh) 

+ d(y — tsh, y + tv) 

Due to the continuity of the Finsler structure we have 

d(x + tv, x + tsh) = t\v — sh\ x + o(t) 

and d(y — tsh, y + tv) = t\v + sh\ y + o(t). 

Thus we conclude 

d(x + tv,y + tv) < t\v - sh\ x + \h\(l - 2ts) +t\v + sh\ y + o(t). 

If s goes to infinity then \v — sh\ x — s converges to b^(v) and \v + sh\ y — s 
converges to b y _ h (v). Thus we arrive at: 

(4) d(x + tv,y + tv) < \h\ + tb x h {y) + tb v __ h {v) + o(t). 
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Now we take a large s and estimate d(x + tv,y + tv) from below by 

d(x + tv,y + tv) >d(x — tsh, y + tsh) — d(x — tsh, x + tv) 

— d(y + tv,y + tsh) 

As above we derive from it 

(5) d(x + tv,y + tv) >\h\- tb x _ h (v) - tb v h {y) + o{t). 

Since we know that b\ = b\ = —b y _ h = —b x _ h we can combine (4) and 
(5) and obtain d(x + tv,y + tv) = d(x, y) + o(t). Thus we have shown 

r (o) = o. □ 

This finishes the proof of Theorem 11.31 
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